Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Feeds to Scour
SubscribedAll
Scoured 4633 posts in 80.5 ms
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com·2h·
Discuss: Substack
AI Ethics & Alignment
Preview
Report Post
TOON for LLMs: A Comparative Performance Analysis against JSON
gist.github.com·5h·
Discuss: DEV
💬Prompt Engineering
Preview
Report Post
is this legit? Supposedly LangVAE straps a VAE + compression algorithm onto any LLM image, reduces resource requirements by up to...
arxiv.org·3d·
Discuss: r/LocalLLaMA
📉Model Quantization
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to·50m·
Discuss: DEV
📝NLP
Preview
Report Post
The 2025 Guide to Machine Learning
ibm.com·1d·
Discuss: Hacker News
🔢Embeddings
Preview
Report Post
Why A.I. Didn’t Transform Our Lives in 2025
newyorker.com·9h·
Discuss: r/TrueReddit
💬AI Code Assistants
Preview
Report Post
A Farmer Doesn’t Know Coding, But Tries to Build an Executing Engine with LLMs and a Code Interpreter
reddit.com·8h·
Discuss: r/LocalLLaMA
💬Prompt Engineering
Preview
Report Post
Show HN: Why is ML inference still so ad-hoc in practice?
news.ycombinator.com·1d·
Discuss: Hacker News
🧩LLM Integration
Preview
Report Post
Show HN: Chat-DeepAI – DeepSeek pricing and getting-started guides (fan project)
chat-deepai.com·7h·
Discuss: Hacker News
🔊Text-to-Speech
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
📉Model Quantization
Preview
Report Post
Differentially Private Federated Learning: A Client Level Perspective
paperium.net·20h·
Discuss: DEV
🔒Differential Privacy
Preview
Report Post
Introducing the XLab AI Security Guide
lesswrong.com·3h
🛡️AI Security
Preview
Report Post
Streamlinear, a new MCP for Linear
blog.fsck.com·20h
🏔️Alpine.js
Preview
Report Post
Your Team Uses AI. Why Aren't You 10x Faster?
bits.logic.inc·1h·
Discuss: Hacker News
AI-Driven DevOps
Preview
Report Post
Book Review: Why Machines Learn
philippdubach.com·20h·
Discuss: Hacker News
🗂️Vector Databases
Preview
Report Post
Attention from First Principles
metaworld.me·3d·
🧱Chunking
Preview
Report Post
The $20 Billion Strategic Warning Shot: Why NVIDIA Fused the LPU into the CUDA Empire
dev.to·15h·
Discuss: DEV
Performance Engineering
Preview
Report Post
Federated Machine Learning and the Future of Data Privacy
vrize.com·1d·
Discuss: DEV
🔒Digital Privacy
Preview
Report Post
This Week in AI: Key Insights from the Latest Podcast Conversations
dev.to·2d·
Discuss: DEV
💬AI Code Assistants
Preview
Report Post